Impact of loop statistics on the thermodynamics of RNA folding 
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Figure 1: Schematic representation of a secondary RNA struc- 
ture. Solid lines denote the RNA backbone, broken lines 
base pairs, and gray lines non-nested backbone bonds that 
are counted by the variable M; here M = 11. 
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Loops are abundant in native RNA structures and proliferate close to the unfolding transition. By 
including a statistical weight ^ £~'^ for loops of length £ in the recursion relation for the partition 
function, we show that the heat capacity depends sensitively on the presence and value of the 
exponent c, even for a short explicit tRNA sequence. For long homo- RNA we analytically calculate 
the critical temperature and critical exponents which exhibit a non-universal dependence on c. 
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Apart from its role as an information carrier, RNA 
has regulatory and catalytic abilities Since this 
RNA functionality is mostly determined by its three- 
dimensional conformation, the accurate prediction of 
RNA folding from the base sequence is a central issue [3] . 
To a fairly good approximation RNA folding can be sep- 
arated into the formation of a secondary structure, com- 
pletely determined by the enumeration of all base pairs 
present in a given sequence, and the tertiary structure 
formation which only operates on the already existing 
secondary structural elements^. This constitutes a ma- 
jor simplification compared to the protein folding prob- 
lem. Since the folding of even short RNA molecules takes 
much longer than reachable with all-atomistic simula- 
tions including explicit solvent, the more modest goal of 
obtaining the most probable secondary structures based 
on experimentally derived base-pairing and base-stacking 
free energies has been pursued 0, 

Due to the high number of unpaired bases, loops are 
abundant in RNA even at low temperatures. Polymer 
theory predicts the configurational weight of a loop con- 
sisting of £ bases to decay as £~'^ where the exponent c is 
universal and depends on the number of strands emerg- 
ing from the loop[6l|. We formulate the RNA partition 
function including the proper weight of loops using the 
same exponent c for terminal, internal, as well as multi- 
loops For a homo- RNA in the thermodynamic limit 
a folding transition is known to exist in the finite range 
2 < c < 2.479[i El El. We analytically calculate the 
c-dependent critical exponents of that transition. Crit- 
ical effects are quite small which explains why they are 
not manifest in numerical calculations [l^. On the other 
hand, the non-critical effects of varying c are pronounced, 
even for real finite-length RNA sequences. We numeri- 
cally calculate the heat capacity of a yeast tRNA with 76 
bases using experimentally determined base pairing and 
stacking free energies At low temperatures the most 
probable structure consists of a characteristic clover-leaf 
structure and thus includes a multi-loop with four he- 
lices. Neglecting the loop statistics shifts the maximum 
of the heat capacity by more than 20 K, whereas includ- 
ing a realistic exponent c gives heat capacity curves that 



agree much better in shape with experimental results 14 1 . 

A primary RNA structure is fully determined by 
the base sequence {&Ar} which is a list of nucleotides, 
bi =C,G,A or U with N entries. In agreement with 
previous treatments, a valid secondary structure is a 
list of all base pairs with the constraint that a base can 
be part of at most one pair. In addition, pseudo-knots 
are not allowed, i. e. for any two base pairs and 
{k,l) with i < j, k < I, and i < k we have either 
i<k<l<joTi<j<k< /[IS,]. The statistical weight 
of a secondary structure depends on the free energy of 
base pair formation but also on the stacking energy of 
neighboring base pairs. For two neighboring pairs 
and (i -|- 1, J — 1), the free energy containing both pairing 
and stacking is g[{bi,bj), {bi+i,bj^i)]. The statistical 
weight of a helical section starting with and ending 
with {i + hj ~ h) is = exp[-/3(5'[6j, 6^] + 

Et=i 9[{bi+h'-i,bj-h'+i)-, {bi+h' ,bj^h')] + g^[bi+h, bj-h])] 
where (3~^ = ^bT. Here g', are initialization and 
termination free energies for base pairs located at the 
helix ends. All values g, g', are extracted from 
experiments [isjl . see supplementary information [iQ] . 

In our notation, the canonical partition function Qf^j of 
a sub-strand from base i at the 5' end through j at the 
3' end depends on the number of non-nested backbone 
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bonds M0, [3, see Fig. [T] The recursion relations for 
Qfj can be written as 
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Eq. (|Ta|) describes elongation of an RNA structure by ei- 
ther adding an unpaired base (first term) or by adding an 
arbitrary sub-strand ^j^^ that is terminated by a helix. 
Eq. (jlbp constructs Q^. ^j^^ by closing structures with m 
non-nested bonds, summed up in ^ helix 

of length /i. A^ioop = 3 is the minimum number of bases 
in a terminal loop. Uf(M) and v\{M) denote the num- 
bers of configurations of a free and a looped chain with M 
links, respectively, for which we use the asyinptotic forms 
vi{M) = y^^M^-^ and v\{M) = M'" The de- 

pendence on the monomer fugacity y and the exponent 
7 drops out by introducing the rescaled partition func- 
tion Qf^- = Qf5/(y-'"'MT-i) j^^j .^^^j considered 



further. The unrestricted partition function of the entire 
RNA is given by Zn = T,m Qo^- 

The loop exponent is Cideai = 3/2 for an ideal poly- 
mer and csAW = dv ~ 1.76 for an isolated self avoid- 
ing loop with ~ 0.588 in rf = 3 dimensions 17|. 
However, helices which emerge from the loop increase c 
even further. In the asymptotic limit of long helical sec- 
tions renormalization group predicts ci = dv + ai — Icr^ 
for a loop with I emerging helices [3] where ai = 
el{2-l)/16 + e^l{l-2){8l-21)/512 + 0{e^) in an e = 4-d 
expansion. One obtains ci = 2.06 for terminal, C2 — 2.14 
for internal loops and C4 = 2.16 for a loop with four 
emerging helices. The variation of c over loop topologies 
that appear in the native structure of yeast tRNA-phe 
(shown in the inset Fig. ^ is thus quite small which 
justifies our usage of a constant exponent c for loops 
of all topologies. For larger / the e expansion predic- 
tion for c; becomes unreliable. We therefore treat c as 
a heuristic input parameter which can be thought to ac- 
count for other loop-length dependent effects (such as 
salt-dependent electrostatic loop self energies) as well. 

We implement the recursion relation, Eg. (fTl) , numer- 
ically using a free energy parameter set[l3, ll6| that al- 
lows for the wobble base pair GU in addition to the usual 
Watson-Crick pairs (GC and AU). The boundary condi- 
tions are Qf'^j = for M > j ~ i, M < 0, or j - i < 0, 
except for the initial condition Q~l_i = 1. In Fig. [5] we 
show the experimental heat capacity of the tRNA-phe of 




Figure 2: Experimental heat capacity of the tRNA-phe of 
yeast for NaCl concentrations 20 mM (triangles) and 150 mM 
(squares) [31 ■ Solid lines show results using Eq. ([T]) with 
loop exponents c = 3.0, 2.16, 1.76, (from left to right), 
compared with the results from the program RNAheat in the 
Vienna package 20] which uses a linearized multi-loop en- 
tropy for large loops (dashed curve). The dotted curve is 
obtained with c = 3 and the same energy parameter set as for 
the solid curves, except for the loop initiation penalty which 
was omitted by setting = <?*. The inset sketches the low- 
temperature secondary RNA structure obtained from Eq. ([1} , 
which perfectly matches experimental crystal structures. 



yeast compared with our predictions from Eq. ([T]) using 
C = Td^{kBTliiZN)/dT^. The heat capacity peak cor- 
responds to the gradual melting of the secondary struc- 
ture. Although the RNA consists of just 76 nucleotides 
and is therefore far from the thermodynamic limit where 
one expects asymptotic effects to be important, the loop 
exponent c has drastic effects. Increasing c from to 
3 changes the peak width and height and decreases the 
melting temperature by more than 30 K (solid lines). In 
similar studies on DNA, where loops only appear close to 
the denaturation transition, the loop exponent was found 
to have much less influence [l^. In contrast, in RNA 
structures a large fraction of bases are unpaired and the 
correct modeling of loops is more important. It is difficult 
to directly compare experimental and theoretical curves, 
as the standard energy parameters used for secondary 
RNA-structure predictions are determined at 1 M NaCl 
concentration [l3| , while experimental heat capacity data 
is only available at 20 mM and 150 mM. Although the ac- 
tual value of the loop exponent is not crucial (compare 
c = 1.76 and c = 2.16), the effect of neglecting loop 
statistics (i.e. setting c = 0) is almost as big as omitting 
the loop initiation penalty contained in 5* (dotted line), a 
well established parameter, or changing the experimental 
salt concentration. Current secondary structure predic- 
tion tools approximate the entropy for large multi-loops 
by an affine function ln(y^M-'=) ^ So + SiM 0, [2fl], 
which corresponds to a loop exponent c = 0. This is cor- 
roborated by the near agreement of the results from the 
Vienna package (20j (broken line) with the results from 
Eq. ^ using c = 0. It therefore is important to treat 
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the statistical weight of multi-loops on the same footing 
as terminal or internal loops, if they appear in the RNA 
groundstate, as is the case for tRNA. Our formulation of 
the RNA partition function can be generalized to more 
complicated loop weight functions to model the effects of 
salt or ligand binding. 

We now consider homo-RNA, which can be realized 
experimentally with synthetic alternating sequences like 
[>lJ7]Ar or [GCJat. The goal is to extract the critical 
asymptotic behavior embodied in Eq. ([T]) in the ther- 
modynamic limit. We neglect base stacking, helix initia- 
tion and termination and simply give a statistical weight 
w = exp[— £/(kBT)] to each base pair. This can be 
viewed as a coarse-graining approximation for natural or 
random RNA above the glass transition. Due to transla- 
tional invariance Eqs. (jlap and (jlbp simplify and can be 
combined to 
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where we introduced the total number of backbone seg- 
ments N = J — i of a part which ranges from base number 
i through j. Next, we introduce the generating function 
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z^s*^g*f . (3) 



For RNA with no external force one has s — 1 and 
the sum over Af, the number of non- nested backbone 
bonds, is unrestricted. In general, s — exp{—f3G{F)) > 1 
where G{F) is the change in free energy of a non- 
nested bond caused by a mechanical force F applied 
at the RNA ends. For a freely jointed chain one has 
G{F) = /3~Mn [(/3Fa)-isinh(/3Fa)], with a being the 
Kuhn length. Combining Eqs. ^ and ^ yields 
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where the function k{z) — X]?/=o ^^Q% ^^'^ grand 
canonical partition function of an RNA with paired ter- 
minal bases. From Eqs. ([3]) and ([4]) k{z) follows as the 
positive root of 



k{z){k{z) — 1) = wLic {zk{z)) 



(5) 



where we use the polylogarithm Lie (x) = X^i^i /n'^. 
The thermodynamic behavior for — > oo is determined 
by the singularity of the generating function Z{z, s) that 
is nearest to the origin in the complex z-plane. In par- 
ticular, if z* is the dominant singularity with Z{z, s) ^ 
Ci{z* — zY for z — > z* and non-integer w, then Zpf(s) ~ 

N^^'^^^'f z* and the Gibbs free energy becomes 

to leading order G = -keTlnZjv = A/'keTlnz*. 

It turns out that Z{z, s) has two singularities, first 
a branch point of k{z) that follows by differentiating 



Eq. ([5]) and whose position zy, is determined by 

n{zb)^ = wLic-i {zhK{zb)) - wLic (zb'«(zb)) . (6) 

Second, a simple pole Zp that follows from Eq. (|4]) and is 
determined by 



ZpK(Zp) = 1 . 



(7) 



The crossing of both singularities defines a critical point 
which is obtained by solving Eqs. ((S])- ^ simultaneously. 
The critical base pairing weight Wcv as a function of the 
applied force fugacity s reads in closed form 



Lic-i (s' 



■ (8) 

In Fig. [3^ we show the phase diagram of RNA in terms 
of w and s for different values of the loop exponent c. 

Let us now consider the force- free case, i. e. s — 1. 
Eq. (HI) simphfies to Wcr = (Cc-i - Cc) (Cc-i - 2Cc)~^ 
where Cc = Lie (1) is the Riemann zeta function. It im- 
mediately follows that Wci- is finite and non-zero only in 
the exponent range 2 < c < c* with c* ~ 2.479 de- 
termined by Cc*-i — 2Cc* — 0. For c — > 2 from above, 
Eq. (HI) predicts Wcr ^ 0. Thus, for c < 2, the RNA is 
always in the folded state and Zn{s) ~ N~^^'^ Zh^^ is 
characterized by the branch point Zb irrespective of how 
weak the pairing energy is [3, [llj . For c > c* the RNA 
is always unfolded and Zn{s) ~ Zp~^ and is determined 
by the simple pole. Right at the critical point, i. e. for 
2 < c < c* , Zcr — Zb ^ Zp and w — Wcr, the loop statistics 
become crucial and we obtain the new scaling 
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This gives rise to non-universal critical behavior. The 
specific heat possesses a weak non-analyticity at the 
(n-|-2)-order critical point, meaning that the v}^ deriva- 
tive with respect to temperature diverges as 
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with X = n - {3 - c)/{c - 2) for T < T„ and x = 1 
for T > Tci and n being the integer with (c — 2)^^ — 
1 < n < (c — 2)~^, see Fig. [Hb and the supplementary 
information [16]. For c — > c* we have n — > 2; for c — s- 2 
we find n ^ 00. The fraction of paired bases is obtained 
via differentiation 6 ~ dlnZ/{Nd\nw) and reads 



2Lie (zbK^Zb)) 
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for T < Tcr and 
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As before the singularity at the critical point is very weak 
and the n*'' derivative of exhibits a cusp 
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Figure 3: a) Phase diagram for three different values of the loop exponent c as a function of the base pairing weight w and force 
fugacity s featuring an unfolded phase (bottom) and a folded compact phase (top), following Eq. (|8]). The phase boundaries 
diverge at the vertical dotted lines. For c = c* ~ 2.479 the phase boundary approaches s = 1 and therefore only the unfolded 
phase exists. For c < 2 there is only the folded phase. The dots denote the unfolding transition in the absence of external 
force, i. e. s — 1, which is considered in b) and c): Temperature dependence of the b) specific heat C and c) fraction of bound 
bases S for c = 2.3. The insets show the third derivatives C" = d^C/dT^ and 0"' = d''6l/dr^ which clearly exhibit singular 
behavior. Squares denote numerical solutions of Eqs. ([5)l and ((6]) (T < Ta) or ((TJ (T > Ta). The solid lines show the leading 
order expansion around Tcr (denoted by vertical dotted lines), according to which C'" diverges with the exponent x = 2/3 for 
c = 2.3, see Eq. (|10|l . and 6"' is characterized by the exponent A — 1/3, see Eq. (|13|l . In the inset of c) the analytical result 
including the next-leading order is also shown (broken line). The horizontal broken line in c) denotes the residual fraction of 
bound pairs at infinite temperature. 



with A = (c- 2)-i - n for T < and A = 1 for T > T„, 
see Fig. |3t where these asymptotic results are compared 
with numerical solutions of Eqs. ©-([T]). 

We account for the asymptotic statistics of loops by 
including a loop-length dependent weight i^'' in the re- 
cursion relation for the partition function of RNA sec- 
ondary structures. As a function of the loop exponent 
c we obtain exact critical exponents and boundaries be- 
tween folded and unfolded phases for a simplified homo- 
RNA model. Because the folding transition is at least 
of fourth order, the singular contribution to thermody- 
namic observables such as heat capacity or fraction of 
paired bases turns out to be quite small. On the other 
hand, the non-singular contribution at temperatures well 
below criticality depends crucially on c. This is demon- 
strated for an explicit sequence of a yeast tRNA by cal- 
culating heat capacities for various values of c and com- 
paring with experimental data. It is seen that including 
realistic values for c is important and produces effects 
that are comparable to changing base pair stacking pa- 
rameters or changing the salt concentration. So the con- 
clusion is that while the dependence of critical properties 
on the loop exponent c is experimentally difficult to ac- 
cess and therefore largely irrelevant, the dependence of 
non-critical properties on c is important. 

We are currently expanding the theory to allow for 
loop exponents that depend on the actual number of he- 
lices emerging from a given loop and to include tertiary 
contacts such as pseudo-knots or base triples which have 
been shown to play an important role in RNA folding [l^. 

Support from the Elitenetzwerk Bayern within the 
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We give the coefficients of the expansion of the branch point position «b and the value Kb of the 
function i<i{z) in the vicinity of the critical point. We also list the free energy parameters used in 
our numerical study of the melting curve of the yeast tRNA-phe. 

'66 • 

o : 

O ■ EXPANSION OF THE BRANCH POINT NEAR THE CRITICAL POINT 
(N ; 

'"^ ' For T < Tcr the dominant singularity of Z{z), which determines the thermodynamics, is the branch point at z = ^b- 

i—^ , The branch point has its origin in the singular behavior of the function of k(z), which is determined by the equation 

^1 h{z){k{z) -I) =w'Lic{zk{z)) . (1) 



In this section we are considering the asymptotic behavior of the branch point position Zb (T) and the functional value 
^^-H Hh{T) '■= K(zb(r)) at the branch point in the vicinity of the phase transition, in particular for T — > Td- while T < Tcr- 
The existence of the phase transition implies for the loop exponent 2 < c < c* « 2.479. 

The equation determining the position Zh of the branch point follows by differentiating Eq. ([T|) with respect to k, 
d leading to 
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I . ^(^b) = wUc-l {z]^K{zh)) - wUc (zbK(zb)) (2) 

Ch [ where dLic (x) /dx = Lic-i (x) jx is used. For T > Tcr the pole Zp of Z(z) is the dominant singularity. It follows from 

^ ■ the equation 

ZpK(Zp) = 1 . (3) 

^ _ For the calculation of critical exponents, we always consider the force- free case characterized by s = 1 where the RNA 

■ is not stretched externally. 

(N 
m 

• ' Scaling of the partition function at T = Tc, 

O , 

] The order of the branch point exactly at T = Tcr is calculated by expanding Eq. ([T]) in powers of z/zcr — 1 and 

. k{z)/ Kcr ~ 1 while keeping w = fixed. Note that both Eq. ^ and ([3]) hold at T = Tcr. One obtains 
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Thus, the asymptotic behavior of the generating function at T = T^r is 

1 

^^^'^^'^^T^^^^-r ^c. r(i-c)j ■ 

Z{z, s) ^ Ci{z — Zcr)'^, for z Zcr and non-integer lu, implies Zn ^ N"^'^^^'' Za ^ ^ and one obtains 

Zn - iV(2-^)/(^-l)zcr-('^+^) . (6) 
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Expansion of the branch point for T < Tct 

To obtain the critical behavior for T < Tcr we perform an asymptotic expansion of Eqs. ([1]) and ([2]) around the 
critical point, where the branch point and the pole coincide. Thus, at the critical point all Eqs. (1-3) have to hold 
and we obtain the critical values exactly as 

Kcr = i f 1 + + 4wCc) , Zcr = 2 (l + ^ 1 + iwCc) ' , and Wcr = " ■ (7) 



2 V y V / (Cc-1 - 2Qr 

As an ansatz for Zh{T) and Kb(r) we use a power series in d = w/wa — 1, where w = exp[— £/(kBr)] is the 
Boltzmann weight of a base pair. To simplify notation, we define a = (c — 2)~^ and write 



Zh/zci ^ I + aid + a2d^ + . . . + d"(^aa + Ua+id + aa+2(f + ...)+ a2a-id^°' ^ + . . . (8a) 

Kh/Hcr ^l+bid+ h2d^ + ...+d"{ha+ ba+ld + ha+2(f' + ...)+ fo2a-ld^"~^ + . . . (8b) 



Plugging the ansatz ^ into Eqs. ([T]) and ([2]) we can solve order by order. To do so, the series representation of the 
polylogarithm Lii, {x) around a; = 1 is used [21 

Li. (x) ~ - C.-i(l + i(C.-2 - C.-i)(^ - 1)' + ... (1 - xf-^ (r(l - ,.) + 1(1 - u)T{l - i.)(l - x) + . . .) . (9) 

We obtain the coefficients for Zb 

ai = (10a) 

Cc-l 

a2 = 7^(2Cc-i-Cc) (10b) 

«3 = -4^(5^1 - 6Cc-iCc + 2^2) (10c) 

04 = 7y^(14Cf_i - 28C'_iCc + 20Cc-iCc - 5Cf) (lOd) 



^5 = -^mU - 120Ce'-iCc + 135dCc - 70Cc-iCc + 14Cc) (lOe) 



a„ = (lOf) 

r(i - c) + r(2 - c) ^ QU Kc-iCc + 2C^ ^ 

aa+i = 7 ^^77; T7 (lOg) 



Cc-1 V r(2 - c)Cc-i 

(r(l - c) + r(2 - c)) + 4(c - 2)Ce-iCc + (5 - 3c)C^) ^ ei-3Cc-iCc + 2C^ V+^ 

""^^ ^ I r(2-c)c-i j ^'"'^^ 



a2a-i = (lOi) 



We obtain the coefficients for Kb 

61 - 7^ (11a) 

Cc-1 

62 = -75^(Cc-i-Cc) (lib) 
63-2-|^(Cc-i-Cc)' (11c) 
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h = -5-7^(Cc-i 



i4-^(Cc-i-Ccr 

Sc-l 



(Ud) 
(lie) 



ba = - - 
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d - 3Cc^iCc + 2Cg 

r(2 - c)Cc-i 



r(i - c) + r(2 - c) / Cc-i - 3Cc-iCc + 2Cl 



2\ " + 1 
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Cc-1 V r(2 - c)Cc-i 

Cti - 3Cc-iCc + \ " C'-i - (c - 2)Cc-lCc - 



r(2-c)Cc-i ; (c-2)C_i 

Cc-i-Cc / Cc-1 - 3Cc-iCc + 2C2 



(llf) 



(llg) 



2(c-2)2.r(2-c)Cti V r(2-c)Cc-i 
X (^2(c - 2)r(l - c)(Cc-i - 2Cc)(Cc-i + 2(c - 2)Ce-iCc + (5 - 3c)C) 

+ r(2 - c)((c - 3)C!_, + {c- 3)(4c - 7)C'_iCc - (3c - 5)(4c - 9)Cc-iCc + (3c - 5)(4c - 7)C^)^ 



(Uh) 



Cc-2 - 3Cc-i + 2Cc / Cc'-i - 3Cc-lCc + 2^2 ^ ' 



(c-2)r(2-c) 



r(2 - c)Cc-i 



(Hi) 



Remarkably, the expansion of the product Zb^b yields 

^b^b - 1 + (a„ + ba)d" + (aa+i + ha,, + b^^+i + ai6„)d"+^ + 0(d"+2), (12) 

meaning that all integer powers d", for n < a, vanish. This fact renders the transition of high order when c — s- 2 and 
thus a — >• oo. For the fraction of bound bases this can be seen directly by expanding the low temperature expression 

9 = 2 Lic(^b^^b) ^^3) 

Llc-l (ZbKb) 

using Eqs. ^ and P^ . 
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ENERGY PARAMETERS 

The enthalpy h and entropy s parameters used in our calculations are taken from reference [1]. For instance, the 
entries in the row UA and the column GC, ft-uA.GC and sua,gCi give the enthalpy and entropy contribution due to 
the stacking of the two neighboring base pairs UA and GC, where U and C are located at the 5 '-end. The bottom 
two rows contain the initiation and termination contribution. For instance, the total free enthalpy of the triple helix 

3'-GCU-5' 93 = [hcCGC + ^GCAV + h^Q + ft-^u) ~ ^(sCG.GC + SGC.AU + s'cG + *Au)- 



Enthalpy h / (kcal/mol) Entropy s / (10~^kcal/(mol K)) 





AU 


UA 


CG 


GC 


GU 


UG 




AU 


UA 


CG 


GC 


GU 


UG 


AU 


-6.82 


-9.38 


-11.40 


-10.48 


-3.21 


-8.81 


AU 


-19.0 


-26.7 


-29.5 


-27.1 


-8.6 


-24.0 


UA 


-7.69 


-6.82 


-12.44 


-10.44 


-6.99 


-12.83 


UA 


-20.5 


-19.0 


-32.5 


-26.9 


-19.3 


-37.3 


CG 


-10.44 


-10.48 


-13.39 


-10.64 


-5.61 


-12.11 


CG 


-26.9 


-27.1 


-32.7 


-26.7 


-13.5 


-32.2 


GC 


-12.40 


-11.40 


-14.88 


-13.39 


-8.33 


-12.59 


GC 


-32.5 


-29.5 


-36.9 


-32.7 


-21.9 


-32.5 


GU 


-12.83 


-8.81 


-12.59 


-12.11 


-13.47 


-14.59 


GU 


-37.3 


-24.0 


-32.5 


-32.2 


-44.9 


-51.2 


UG 


-6.99 


-3.21 


-8.33 


-5.61 


-9.26 


-13.47 


UG 


-19.3 


-8.6 


-21.9 


-13.5 


-30.8 


-44.9 




7.33 


7.33 


3.61 


3.61 


7.33 


7.33 


s' 


9.0 


9.0 


-1.5 


-1.5 


9.0 


9.0 




3.72 


3.72 


0.00 


0.00 


3.72 


3.72 




10.5 


10.5 


0.0 


0.0 


10.5 


10.5 
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